Natural Language with Integrated Deictic and Graphic Gestures
نویسندگان
چکیده
People frequently and effectively integrate deictic and graphic gestures with their natural language (NL) when conducting human-to-human dialogue. Similar multi-modal communication can facilitate human interaction with modern sophisticated information processing and decision-aiding computer systems. As part of the CUBRICON project, we are developing NL processing technology that incorporates deictic and graphic gestures with simultaneous coordinated NL for both user inputs and system-generated outputs. Such multi-modal language should be natural and efficient for human-computer dialogue, particularly for presenting or requesting information about objects that are visible, or can be presented visibly, on a graphics display. This paper discusses unique interface capabilities that the CUBRICON system provides including the ability to: (1) accept and understand multi-media input such that references to entities in (spoken or typed) natural language sentences can include coordinated simultaneous pointing to the respective entities on a graphics display; use simultaneous pointing and NL references to disambiguate one another when appropriate; infer the intended referent of a point gesture which is inconsistent with the accompanying NL; (2) dynamically compose and generate multi-modal language that combines NL with deictic gestures and graphic expressions; synchronously present the spoken natural language and coordinated pointing gestures and graphic expressions; discriminate between spoken and written NL. 1 I N T R O D U C T I O N One of the strong arguments in favor of using Natural Language (NL) processing systems as front-ends to sophisticated application systems is that if human-computer communication is conducted in an NL that most users know, then the cost of training a user to use the system 1This research was supported, in part, by the Defense Advanced Research Projects Agency and monitored by the Rome Air Development Center under Contract No. F30603-87-C-0136 and the National Science Foundation grant No. SES-88-10917 to The National Center for Geographic Information and Analysis 2 Calspan Corporation 3State University of New York at Buffalo
منابع مشابه
Interaction of Speech, Deixis and Graphical Interface
To solve certain problems of multimodal interaction the concept of graphical utterances is introduced. Two different functions of deictic gestures are discussed: deictic gestures may be used to focus on a certain context of interpretation and they may be used to provide a referent for an natural language expression. The relations between deictic gestures and visual utterances are presented. Pro...
متن کاملActive And Passive Gestures - Problems With The Resolution Of Deictic And Elliptic Expressions In A Multimodal System
This paper deals with aspects of the resolution of deictic and elliptic expressions that are related to gestures. It discusses different approaches to distinguish between deictic pointing and manipulative gestures. We compare two strategies of combining natural multimodal communication with direct manipulation. The first approach uses click free mouse gestures for deictic pointing, while manipu...
متن کاملGestures of a virtual guide
This thesis describes a research project on the development of a virtual guide resulting in an Embodied Conversational Agent (ECA), which means it is capable of interacting with users through verbal and nonverbal communication. Since an ECA requires a lot of functionality, equal to the combined functionality of a dialogue system, a multimodal interface and a software agent [CSPC00], existing so...
متن کاملCombining Deictic Gestures and Natural Language for Referent Identification
In virtually all current natural-language dialog systems, users can only refer to objects by using linguistic descriptions. However, in human face-to-face conversation, participants fre= quently use various sorts of deictic gestures as well. In this paper, we will present the referent identification component of X T R A , a system for a natural-language access to expert systems. X T R A allows ...
متن کاملA Corpus of Natural Multimodal Spatial Scene Descriptions
We present a corpus of multimodal spatial descriptions, as commonly occurring in route giving tasks. Participants provided natural spatial scene descriptions with speech and abstract deictic/iconic hand gestures. The scenes were composed of simple geometric objects. While the language denotes object shape and visual properties (e.g., colour), the abstract deictic gestures “placed” objects in ge...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 1989